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Abstract. Wc show how to bound the accuracy of a family of semi-definite 
programming relaxations for the problem of polynomial optimization on the 
hyperspherc. Our method is inspired by a set of results from quantum in- 
formation known as quantum de Finetti theorems. In particular, we prove 
a de Finetti theorem for a special class of real symmetric matrices to estab- 
lish the existence of approximate representing measures for moment matrix 
relaxations. 
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1. Introduction 

In this paper we will discuss the solution of polynomial optimization problems 
of the following form: What is the maximum value v of a polynomial T{x) = 
T{xi, . . . ,Xn) in n real variables evaluated on the hypersphere S"^^ ? More pre- 
cisely, our aim is to solve the optimization problem 

maximize T{x) 
subject to ||a;||2 = 1 , 

with II a; II 2 = Optimizing polynomials over the sphere or other compact con- 

vex sets K has a large number of applications ranging from material sciences [3H] , 
quantum mechanics |14j , numerical linear algebra, and signal processing to portfolio 
optimization (see |21| for an extensive list of applications and references). Exam- 
ples of such problems within theoretical computer science itself include finding the 
maximum stable set {K being the simplex) or maximum cut {K being the Boolean 
cube) in graphs (see e.g. [TT] for a survey). 

Not surprisingly, not all K pose an equal challenge when it comes to optimization. 
For example, when K is the simplex it has been known for some time that one can 
approximate the optimum of T{x) over K in polynomial time if T is quadratic [3] , or 
more generally if T is a homogeneous polynomial of a fixed degree [12] (see also [TT] ) . 
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The case of optimizing over the sphere, however, remains poorly understood in 
comparison [TT]. Whereas the problem is easy if T has degree at most 2 [TT], it 
is known that already for degree 3 the problem of optimizing a polynomial over 
the sphere is NP-hard [31]. This leads to the question of whether polynomial 
time approximation algorithms do exist. For example, it was questioned in [T^ 
whether there exists a polynomial time approximation algorithm for homogeneous 
polynomials over the sphere. Adhoc approximation algorithms are known for a 
variety of special cases, for example for optimizing biquadratic polynomials 
or for optimizing cubic polynomials |42| . Recently, a poly time approximation 
algorithm involving random sampling for optimization of homogeneous polynomials 
over the sphere was proposed in |21j . 

There is a general approach to solve any such problem approximately by means 
of a hierarchy of semidcfinite programs (SDP) [321 US] (see [55] for an excellent 
survey). Each level of this SDP hierarchy is of increasing size, and thus more 
difficult to solve computationally. Yet, solving level I of the hierarchy yields an 
approximation that improves for increasing I and is known to converge to v 
due to theorems in algebraic geometry. However, results that bound the quality of 
the approximation for finite i are relatively rare. For optimizing over the sphere, 
using results of Rcznik |36j , Faybusovich |16j derived approximation guarantees for 
the SDP hierarchy when optimizing over the sphere, which leads to a meaningful 
approximation for similar values of ^. However, our approach yields rather new 
insights and an explicit representing measure for every Results are also known 
for the special case that the number of variables n is at least twice the degree of 
T{x) [32| . In the computer science literature, results are also known for the special 
case when the polynomial is of a certain form and has degree 4 [2]. 

Result Here, we prove convergence of a hierarchy of semidcfinite programs for 
optimizing polynomials T on the sphere. More precisely, we show that for any 
homogeneous polynomial T the solution Vf^ of the SDP at level £ approximates the 
true optimimum v as 

Vi- e{n,a,t)v <v <vi , (1) 
where the relative error is given by 

4a2 (a + f + l) 
2£+l 

with a = \d/2\. Since any polynomial of even degree at most on the sphere can 
be turned into a homogeneous polynomial of degree exactly d our result also applies 
to such polynomials. To our best knowledge, these are the first general results about 
convergence of the SDP hierarchies |33[ 126] when optimizing over the sphere that 
yield explicit representing measures. Note that in polynomial optimization the 
degree of the polynomial is generally regarded as a constant, and we care about 
the performance of the algorithm in terms of the number of variables n [llj . These 
bounds do not imply a polynomial time approximation algorithm for optimization 
over the sphere. However, we emphasize that in contrast to other results on the 
convergence of the hierarchy of semidcfinite programming relaxations our results 
hold for any homogeneous T{x), and our approach yields an explicit approximating 
representing measure at any level L 

To prove our results we employ rather different techniques than have been used 
in the past, inspired by results in quantum information theory known as dc Finetti 
theorems. In particular, we prove a novel dc Finetti theorem for a special class of 
real valued matrices. 



"'^I.e., each monomial tm{x) in T{x) = ^rn{x) has oven degree. 
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2. Basic concepts 

Our construction makes essential use of results from quantum information, rep- 
resentation theory, and spherical harmonic functions. In Section \2A] we review the 
necessary ingredients from quantum information. In Section 12.21 we give some es- 
sential background from representation theory. We continue to summarize facts 
of spherical harmonics in Section 12.41 Finally, we define the notion of maximally 
symmetric matrices in Section [2.51 

2.1. Quantum information. 

2.1.1. Notation. Throughout, we use notation that is commonly used within quan- 
tum information which served as an inspiration for our proof. Consider a vector 
space V = C" or V = K" of dimension n 0. We use \v) G to denote a vector 
in V, and {v\ to denote its dual vector, the complex conjugate transpose of \v). 
Let End(F) denote the set of linear operators on V. For any matrix M G End(y) 
we will use M* to denote its complex conjugate and M'^ = (M*)'^ to denote its 
complex conjugate transpose (sometimes denoted by M* in other fields). For any 
matrix M G End(F), we use tr(M) to denote the trace of M, that is, the sum of 
its diagonal entries. The identity is denoted by 1. 

2.1.2. States, systems and tensor products. A (quantum) state is a matrix M sat- 
isfying M > and tr(Af) — 1. We also consider vectors spaces of the form V (}^V, 
where (8) denotes the Kronecker product, also known as the tensor product. When 
considering the tensor product V^^ = V ^ . . . ot £ vector spaces, we will also 
refer to the individual spaces V as systems. If |a;) € V and \y) € V then the 
vector \x) ^ \v) £ V ^ V . If it is clear that from context that we are considering 
tensor products of vectors on different systems, we follow the common convention 
in quantum information and employ the shorthand 

\xy) := \x) ® \y) . (3) 

When amgiuity is possible, we also use \x,y) := \xy). 

Example 2.1. Consider two 2-dimensional vectors spaces = V2 C^, where 
{I'^i}) I ''^2)} and{\vi), \ v2)} o,i"e orthonormal bases for Vi and V2 respectively. A basis 
for the space Vi ® V2 is given by {\viVi), \v1V2), \v2V1), \v2V2)} where for example 

\vivi) \vi) (g) = (a;iii,a;ii2, 2:2*1, a;2i2)^ , (4) 

with \vi) = {xi,X2) and \vi) = {xi,X2)- 

2.1.3. Partial trace of matrices. An important notion we will need later is the so- 
called partial trace operation. When considering polynomials as matrices we will 
see that this operation is analogous to applying the Laplacian to a polynomial. 
Consider the tensor product of spaces U (E) W. For a matrix M E End(?7 (X) W) we 
can define trw{M) € End(J7) the partial trace of M over W such that 

tr [{N (8) I)M] = tr[NtrwiM)] VA^ G End(C/) , (5) 

We can write an explicit formula for the components of trw{M). Let and 
be orthonormal bases for U and W respectively. We can express any matrix 
M G End(C/ W) in terms of this basis as 

M = ^ mu,u'.w.w'\u){u\(g}\w){w'\ , (6) 

u,u' ,w' 



All spaces in this note are of finite dimension. 
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for some coefficients mu,u',w,w' G C. The partial trace of M over W can be shown 
to be as follows 

UwiM):^ m,,,u'.w.w'\u){u\(^tr{\w){w'\) (7) 

= i'^my.yf^^^^] \v){v'\ . (8) 

u,u' \ w / 

We also speak of tracing out W from M to obtain trw{M) G End(l7). When 
U = Y'^o-^ w = yf^b^ g^j^^ ^ a + 6 we also write the partial trace as 

tr;„(M) := tra+i,...AM) := trwiM) , (9) 

to emphasize the number of systems we trace out. It is sometimes useful to note 
that the partial trace can also be written as 

trwiM) = ^(I (g) {w\)M{l (g> \w)) , (10) 

w 

where the sum is taken over basis vectors \w) of an orthonormal basis on W. 

Of particular interest to us will be the partial trace of Hermitian matrices M 
such that M > and tr(M) = 1, i.e., states. When tracing out a number of systems 
of a state AI £ End(y*^) we also call Ma := tra+i ...^^(M) a reduced state of M. It 
is clear from the definition of the partial trace above that tr(Ma) = 1. Similarly, 
note that Ma > for any state M since {v\Ma\v) = tr {{\v){v\ (g) I'*^)M), where I 
is the identity on V and b = £ — a. 

Example 2.2. To get some intuition about the partial trace operation, consider the 
following two examples. First, consider a state M G End(V^'*^) where M = Mi®M2 
and Ml, M2 S End(l^) being states. When tracing out the second system we simply 
obtain tr2(M) = Mi tr(M2) = Mi. Second, let us consider a state M that cannot be 
written as a tensor product of state on the individual systems. Consider M = 
for l^*) = (|11) + |22))/\/2 where {|1), |2)} forms an orthonormal basis for V with 
dimension dmi{V) ~ 2. As a matrix. 



1 



/ 1 1 \ 





\ 1 1 y 

_ 1 
^ 2 

By the definition of the partial trace ([7]) we have 



(11) 

(|1)(1| ® + |1)(2| ® |1)(2| + |2)(1| ® |2)(1| + |2)(2| ® |2)(2|) . (12) 



tr2 (A/) = i (|1)(1| ® tr(|l)(l|) + |1)(2| ® tr (|1)(2|) (13) 
+ |2)(l|®tr(|2)(l|) + |2)(2|®tr(|2)(2|)) 

= i(|l)(l| + |2)(2|) (14) 



(15) 



2.1.4. Matrices as vectors. When considering the case of real valued variables, it 
will be useful to note that any matrix AI £ End(y^^) can be identified uniquely 
with a vector \M) € y'^'^^. Let denote the standard orthonormal basis 

of V. Note that the set of matrices forms a basis for End(y) in which we can 
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express M as 

Af = ^m,,|z)(j| , (16) 

for some coefficients m^j e R (or rriij e C for y = C"). Relative to our chosen 
basis, we can hence define an isomorphism between matrices and vectors given as 

|A/):=^m,,|z)(b-))* . (17) 

Throughout, when concerned with matrices M we will use \M) to denote its vec- 
torized form. It will be useful to note that for this mapping we have for any \x) £V 

{x\M\x) ^ {x\®{{x\)*\M) . (18) 

2.1.5. Norms and a distance measure. Finally, we will use two particular norms on 
the set of operators End(V^). The ffi'st is the operator norm which for Hermitian 
matrices N is simply given by ||-/V||oo = Amaxd-^l), where Amaxd^l) denotes the 
largest eigenvalue of |7V|. We will also need its dual norm, the Ll-norm, which for 
any matrix M G End(y) can be expressed as 

sup tr(iVM) . (19) 

\\N\\^<1 

The Ll-norm leads to a very useful distance measure on the subset of states in 
End(y). In particular, for two quantum states Mi and the following quantity 
forms a distance 

D{Ah,M2) IIM1-A/2II1 , (20) 

also known as the trace distance. Immediately from its definition it is clear that 
the trace distance has a useful property. Suppose two states Mi, A/2 S End(T^) are 
e-close in trace distance, i.e., D{Mi, M2) < e. We then have for any matrix N that 

|tr(7VAfi)-tr(7VA/2)| <e||iV||o, . (21) 

2.2. Representation theory and the symmetric subspace. We will make ex- 
tensive use of tensor product spaces and various elements of representation theory 
of the unitary and orthogonal groups (see e.g. [5D] for an in-depth introduction). 

2.2.1. Unitary and symmetric groups. The space V'^^ with V = C" carries a natural 
representation of the group of permutations of £ objects and the unitary group 
11(71) with dimension n ~ dim(y). Consider the representation of Si on V'^^ given 

by 

TT ® . . . ® = |a;("(i») ® . . . ® jx^'^^") , (22) 

for TT Cz Se. Similarly, consider a representation of U(n) given by the tensor products 
of the defining representation of U(n) on V as 

g (E)...(E) \x^'^)) = (E)...(E) <?|x(^)) , (23) 

for g g U(n). Clearly the actions, of Si and U(n) commute so that this represen- 
tation on V^^ will break up into irreducible representations of the direct product 
group Se X U(7i). 
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2.2.2. Symmetric subspace. The symmetric subspace is the subspace of cor- 
responding to the trivial representation of Sg. The projector onto the symmetric 
subspace is given by 



n.^^E- (24) 



As a result we may write the symmetric subspace as 

Sym(y^'') = {M e End(1^^0 | H^MH^ = M} . (25) 

It is a standard result that the restriction of our representation of U(n) to the 
symmetric subspace is irreducible. Note that any vector of the form \z) :— \x)'^^ 
obeys tt\z) = \z) for all tt and thus n^jz) — |z). So |z) lies entirely the symmetric 
subspace Sym(l/®^). 

2.2.3. Number state basis. A convenient basis for the symmetric subspace is given 
by the so-called number state basis {|i)}i with 

|i):=-i=E^(|l)''^^®---®l«r") : (26) 

where i := (ii, . . . , i„) with ^jij = i, il = and {|1), . . . , |n)} is an 

orthonormal basis for V . Note that the projector into the symmetric subspace can 
also be expressed in the number state basis as 11 = |i)(i|. Furthemore, note 
that the set of vectors {7r|wk) | |wk) := |xi)®''i (g) . . . (g) |a;„)'^''" with tt e Se} spans 
(C")*^^. Hence any matrix M e End((C")'*^) can be written in the form 

M= E f^:T^\vi.){vm\<y , (27) 

k,m 

for some coefficients /^'™. It will be useful to understand what matrices written in 
this form look like when projected into the symmetric subspace. We can compute 
lieMIie by computing the terms 

(i|7r|wk)('Um|cr|j) = {i\vk){Vm\j) = CiCj i!j! (5i,k5j,m , (28) 

where we used the shorthand 

^ (29) 



It is also interesting to consider what happens to the matrix |i)(j| if we trace out 
one system from (C")*^^. We have 

tri[|i)(j|] = CiCj (30) 
E I E tn [(£,mi)|l)«'i...|n)®^"(l|®^^..(n|«^"(£,m2)] 

n 

= CiCJ E E**-?*|l)^'' • ■ • \^)®''~^ ■ ■ ■ • • • {t\®^*~^ . . . 

(31) 

1 " 

with i — et = (ii, . . . , It — 1, . . . , i„), where the first equality stems from the fact 
that we can write any permutation over i elements as a swap between the last and 
any of the others, followed by a permutaton over only ^ — 1 elements. 
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2.2.4. Orthogonal group. While the unitary group wiU be sufficient to deal with the 
case of polynomials on the complex projective space, in the case of real polynomials 
on the sphere we will also need to deal with representations of the orthogonal group. 
As before the space V'^^ carries a natural representation of the orthogonal group 
as follows 



g [^x"-'^) ®... Ix^'^^) j = fflx^^O ®---® 9V^n ■ (33) 

for all g G 0(n). 0{n) is a subgroup of U{n) and again this representation commutes 
with the representation of Se that permutes copies of V. Once again we will be 
able to restrict our attention to the symmetric subspace Sym(y*^). However the 
symmetric subspace is no longer irreducible as a representation of 0{n) but rather 
decomposes into into a number of irreducibles (see e.g. [HJ Chapter V] or [151 
Chapters 17.3,19.5]). 

2.3. Polynomials. 

2.3.1. Real variables. We will ffist consider the case of real polynomials. For a set of 
real variables x G M" a general monomial of order d can be written as x\^ ---x^^ = 
with |i| = d. A general polynomial degree at most d can therefore be written as 

T{x) = aix' , (34) 
|i|<d 

for some coefficients ai G M. We will largely be interested in evaluating these 
polynomials on the hypersphere where the variables xi, . . . ,Xn satisfy 

n 

r{xr:=J2x^^l. (35) 

i=l 

Let us now first consider the case where all terms in T{x) have even order. We can 
then use the fact that we are evaluating polynomials on the sphere to write T as a 
sum of monomials of degree exactly d since for any i with \i\ ~: d' < d we have 

aix' = aix\{xf~'^' , (36) 

which is a monomial of degree d when d is even. Note that this turns any such 
polynomial into a homogeneous polynomial of even degree. 

Let us now consider homogeneous polynomials T(x) of odd degree d = 2a — 1. 
As we show in the appendix (Lemma IB.2p . there exists an explicit constant 7(a) 
such that 

max T' (a:') = 7(a) max T{x) , (37) 

where T'{x') is a homogeneous polynomial of even degree. Solving the l.h.s. thus 
allow us to obtain the solution of the r.h.s. Throughout, we will thus always assume 
that T has been transformed into a homogeneous polynomial of even degree exactly 
2a. 

We define the Laplacian in the usual way as follows 

^^(-) - E S^(^) (38) 

i * 

which maps a polynomial of degree d to a polynomial of degree d — 2. Harmonic 
polynomials are those for which AT(.t) = 0. Real polynomials carry a natural 
representation of the orthogonal group as follows 

T{x) ^ Lg[T]{x) :^T{g-^x) ^g e 0{n). (39) 

While this is not an irreducible representation, it is well known that when restricted 
to the harmonic polynomials it becomes irreducible [HJ [THl [Ml 130] ■ 
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2.3.2. Complex variables. We will also consider polynomials T{x) in complex vari- 
ables X = (cci , . . . , a;„) g C". This is because our techniques originate from quantum 
information and turn out to be somewhat simpler for the case of complex variables. 
In this case it is natural to consider polynomials in x and x* with monomials of 
the form x*i^x'^^ . . . x*'"a;j^a;2^ ■■■x^n — a;*'.T-'. When |i| ~ ac and |j| = or this 
is a monomial of order (acau). A general polynomial will then be of the form 
T{x,x*) = Q!ja;*'a;j. We will be interested in evaluating these polynomials on 
the complex projective space CP"^^ of points x G C" where /3a; is identified with x 
for all /3 ^ G C. The monomial x*^x^ is a function on CP"~^ only if |i| = |j| = a 
and we call this a monomial of symmetric degree d = 2a. Moreover for the max- 
imization to make sense we need T{x^x*) to be real for all x and we call such 
polynomials Hermitian. A polynomial T of order (a, a) is Hermitian if T can be 
expanded in monomials of symmetric degree at most d as follows 

T{x,x*)= a]x*'x> , (40) 

|i| = |j|<m 

for some coefHcients G C satisfying (aj)* = a\- 

The polynomial function r(x, x*) is not invariant under a; — > ca; and is therefore 
not yet a function on CP"~^. To resolve this issue we define a normalised vector 
X = x/\\x\\2 which is invariant under x — > cx for positive real c. As described above 
we have already restricted attention to Hermitian polynomials of symmetric degree 
that are invariant when c = exp(i0) for any angle 9. As a result we may define the 
following function on CP"^^ 

TcP^-i{x,x*)^T{x,x*). (41) 

When we refer to optimising a polynomial function over complex projective space 
we mean that we optimise Tcpn-i(a;, a;*), which is properly a polynomial function 
of X rather than x. However this distinction will usually be unimportant 

As in the real case wc can use the fact that we may embed CP"~^ in the set of 
normalised vectors \x) such that 

r(a;)2 = |(x|x)p=^.x^a-, = l. (42) 
fe 

So we may write T as a sum of a sum of monomials of symmetric degree exactly d 
since for any i and j with |i| = |j| =: a' < a we have 

a]a:*'a;j a]x*'x^r{xf^''-'''^ , (43) 

and this is a sum of monomials of symmetric degree 2a. Throughout, we will always 
assume that T has been transformed in this fashion. 

Another natural operation we can perform on a Hermitian polynomial is to act 
with the Laplacian 

Tix.x*) -> AcTix,x*) - V A^T(x,x*) (44) 

which maps a Hermitian polynomial of degree 2a to a Hermitian polynomial of 
degree 2a — 2. Notice that if we write a; = a;R -I- ixc in terms of real and imaginary 
parts then r(a;,a;*) is a polynomial in the 2n real variables (a:R,a;c)"^ and the 
Hermitian harmonic polynomials of symmetric degree 2a in n complex variables 
become a subset of the harmonic polynomials of symmetric degree d = 2a in 2n 
real variables. There is a natural representation of the unitary group U(n) C 0{2n) 
on the Hermitian polynomials of symmetric degree d as follows 

T{x,x*) ^ Lg[T]{x,x*) ^T{g-'^x,gx*) V.g € U(n). (45) 
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As in the real case this is not an irreducible representation but becomes so when 
restricted to the harmonic Hermitian polynomials of symmetric degree 2a [55] , 

2.4. Spherical harmonics. Our result for real valued variables depends on facts 
of spherical harmonic functions (see e.g. j30j for an in-depth introduction). It should 
be noted that these can be defined in a rather general way (see e.g. |3S]): however 
here we only require spherical harmonic functions on the real sphere S^'-^ C R" 
and hence we restrict ourselves to this case. 

2.4.1. Functions on the sphere. Let us first consider general functions on the sphere 
5'"--i. Wc will use dx to denote the measure on S'""-'^ that is normalized to unity, 



dx=l . (46) 

Let ^^(5*"^^) denote the space of square-integrablc functions on 5""^ with respect 
to the measure dx H. It is a Hilbert space under the natural inner product 

if, 9) ■■= I .f{x)9{x)dx . (47) 

It is well known (see e.g. [SHI US]) that the space of functions can be decomposed 



as 

oo 

l2(5"-1)^0H,(5"-1) , (48) 

where 'Hj{S^~^) is the space of all so-called spherical harmonic functions deter- 
mined by the restriction of all homogenous harmonic polynomials of degree j on 
M" to the sphere. The space 7ij{S^^^^) inherits a natural representation of 0{n) 
from the corresponding representation on the harmonic polynomials, and this rep- 
resentation is still irreducible. For each Hj{S"'~^) there exists a basis of functions 
given by the spherical harmonics Sjm- There exist 

linearly independent spherical harmonics Sjm of degree j. We choose this basis as 
in [30] to be orthogonal so that 

[ Sj,n{x)Sj,,n'{x)dx ^ Sjj'Sjnm'— , (50) 

where 

27r"/2 

Uln 7 — T (51) 

r(f) 

is the surface area of the sphere S""^^ when integrating over the usual Cartesian 
measure in R". Furthermore, any spherical harmonic of degree j can be expressed 
as a linear combination of the Sjm ■ 

In terms of this basis, any (continuous) function / £ L^(S'"^^) converges uni- 
formly to 

oo N(],n) 

fix) =J2Y1 fi^riSj,n{x) . (52) 
j—0 m—1 

Note that this is the generalization of the Fourier transform on the circle to the 
sphere. The coefficients fjm are thus also referred to as Fourier coefficients. 



■^A function is square-integrablc with respect to dx if ||/||2 = {fgn-i \f{^)\^d^)^^^ < oo. 
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Very roughly, the fact that L?{S^^^) admits a decomposition of the form (^5)) 
can be understood as a consequence of the fact that by the separation of variables 
theorem for polynomials j40j , any homogenous polynomial of degree d can be writ- 
ten as a sum of terms each of which consists of a harmonic polynomial of degree 
d—2k multiplied by a radical polynomial r^'""(x). Restricting to the sphere sets the 
radical terms to unity and a general polynomial can be expanded as a sum of ho- 
mogenous harmonic polynomials of degree up to d alone. The fact that any function 
can be decomposed as can now be understood as a consequence of Weierstrass' 
theorem stating that the algebra of all polynomials is dense in L^(S'"~^). Intu- 
itively, this means that any function can be approximated by a sum of polynomials 
of possibly high degree. Considering the origins of said composition, however, it is 
instructive to note that when / in (|52p is a polynomial of degree up to d, we will 
also only observe terms up to j < d. 

2.4.2. Funk-Hecke formula. We will need one important property of the spherical 
harmonic functions. Consider a spherical harmonic fj G %j{S"'~^) and \y) £ S"*"^, 
then 

/ ((a;|y))2V^-(a;)dx = ^A(n,^,jU-(2/) : (53) 

where X{n,£,j) is a normalization factor as defined in Lemma |A.1I This equation 
is a special case of the Funk-Hecke formula |30] applied to the continuous function 
Note that the appearance of the factor 1/cj„ arises from converting [501 
Theorem 6] to account for the fact that, in contrast to Miiller, we are using a nor- 
malised measure on the hypersphere. The Funk-Hecke formula can be understood 
as providing the normalization factor of Schur's lemma for the representation space 

2.4.3. Norms. The set of functions on the sphere admits a set of so-called p-norms. 
Here we will merely need the oo-norm of a function F on the sphere given by 

||F||oo :=^maxj^^(x)| . (54) 

2.5. Maximally symmetric matrices. As noted above we will often need to dis- 
cuss matrices on the symmetric subspacc M G End(Sym(l/®^)) for which H^ AfH^ = 
AI. It can be shown that all matrices on the symmetric subspace in fact satisfy 
JIiM = M and such matrices are sometimes termed Bose symmetric in the quan- 
tum information theory literature. However, in the case of real polynomials we will 
need to consider a set of matrices with an even stronger set of symmetries. 

2.5.1. Definition. We will call M a maximally symmetric matrix if 

7r|Af) = \M) for all t: e 821 , (55) 

where \M) is the vector form of the matrix M as defined in Section [2.1.41 and tt 
is a permutation of the copies of V^^^ as disussed in Section 12.21 Another way 
of phrasing this condition is that H2^|M) = \M). This condition also implies that 
ttM = M for all n (1 Se and so M is indeed a matrix on Sym{V'^^. For the reader 
from quantum information theory, we note that M is not only Bose-symmetric but 
also PPT-symmmetric, i.e., invariant under partial transposes of any subset of the 
£ systems. It is also helpful to note that any maximally symmetric matrix is real, 
since its definition also implies that M = M^. Throughout, we will use 

MSym((M")®0 := {M e End((M")®^ | Ha^jM) = |M)} (56) 
to denote the subspacc of maximally symmetric matrices of End((R")'^^). 



'Notice that this convolution integral commutes with the action of 0(n) on 'Hj{S^ ^). 
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2.5.2. Spherical harmonic matrices S^^. For every spherical harmonic Sjm we as- 
sociate a maximally symmetric matrix, a spherical harmonic matrix, defined as 

S'j^ := / s„n{x)\x}{x\^'dx . (57) 

Clearly we have S'j^ € MSym ((R")**^). We will need to establish several prop- 
erties of the spherical harmonic matrices as defined in ([57]) . The first lemma 
shows that they are in fact orthogonal with respect to the Hilbert-Schmidt inner 
product. It also establishes their normalization. In essence, our first lemma is an 
easy consequence of the Funk-Hcckc formula given in (j53p . 

Lemma 2.3. Let £,j,n E N with n > 3 and j < 11, then for matrices and 
S'j,„,, e MSym((M")®'^) as defined m we have 

tr = 5,y5,nn,^ A(n,^,j) , (58) 

where the normalization factor \(n,i,j) is defined in Lemma \A.l[ 
Proof. Using the fact that the trace is linear and cyclic we have 

tr (5^,„5|,,„,) - / / sj^{x)s,,^,{z)tr{\x){x\'^'\z){z\'^')dxdz (59) 

Sjm{x)sjim'{z){{x\z))'^^dxdz . (60) 



Since t'^^ is continuous for —l<t<l and sjm is a spherical harmonic of degree j, 
we may apply the Funk-Heckc formula [30l Theorem 6] with t = {x\z) (see also ((53)) 
to rewrite 

Sj'm'{z){{x\z))'^^dz = '^^^^X{n,i,j)Sj'm'{x) . (61) 

Combining ([50]) with (pD) , and using the orthogonality of the spherical harmonics 
then gives 

tr = ^^^^^^^^^^^ / s,mix)s,,m'{x)dx (62) 



(63) 



as promised. □ 

In addition to considering the trace of products of spherical harmonic matrices, 
we will also need to know the trace of any such matrix by itself. This is an easy 
consequence of the lemma above and properties of the spherical harmonics Sj„i{x). 

Lemma 2.4. Let £,j,n € N and n > 3, then for all j > the matrix defined 
in ([57)1 satisfies tr (Sj^^) ~ 0. Furthermore, tr (S'q) = 

Proof. The first part follows immediately from the fact that sq(x) is the constant 
function and the orthogonality of the spherical harmonics 

ti" = / Sjm{x)dx (X / Sj,n{x)so{x)dx = . (64) 

The second part follows from the fact that so{x) = c is given by a constant c, which 
is determined by the normalization condition of (|50p 



so{x)^dx = / = = — . (65) 

Hence, sq{x) = 1/ y/UJ^i which together with (IMl) implies our claim. □ 
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3. Polynomials as matrices 

Wc follow the general approach of Parrilo [33] and Laserre [55] to express the 
class of polynomial optimization problems that we are interested in as a hierarchy 
of semidcfinite programs (SDPs). To use this approach, we will need to transform 
the problem involving polynomials into an equivalent problem involving matrices. 
Positivity of the matrices is related to the existence of sum of squares decompo- 
sitions for the polynomial. The specific mapping that we use here appears to be 
somewhat different to the one that is usually used but we will see that it has sig- 
nificant advantages when it comes to analysing the convergence of the sequence of 
SDPs. 

3.1. Matrices for polynomials in complex variables. As a warmup, wc first 
discuss the case of Hcrmitian polynomial functions T{x,x*) of complex variables 
X G CP"~^. This case already captures some of the essential ideas, but is slightly 
simpler than the case of real variables. 

3.1.1. Construction of matrices. In order to map a polynomial to a matrix, consider 
a vector space V — C" with orthonormal basis {\j),j G 1, . . . , n}, where we identify 
each basis element \j) with a particular variable Xj. Products of variables can then 
naturally be identified with the basis of V^°' given by the tensor products of the 
basis for V. An arbitrary a vector \x) £ V can thus be identified with the variables 
of the polynomial as follows 

n 

k) :=E^^-|-?') ■ (66) 

For a Hermitian polynomial T{x,x*) of symmetric degree d = 2a our goal will be 
to find a unique matrix Zt S End(T^'^") such that for all x 

((x|«")ZT(|x)«") = r(x,x*) . (67) 

In what follows, we will also call Zt a polynomial matrix, if Zt represents a poly- 
nomial T{x,x*) in the sense of (j67|l . A very strong motivation for this definition 
is that unitary transformations of the variables \x) U\x) result in unitary trans- 
formations of the polynomial matrix Zt — > W'^'^ZtU'^"' and this group action 
corresponds exactly to the original group action we defined on T{x,x*) such that 
Zl,[t] = g^^'^ZTg®'' for each g € U(7i). 

Let us now make a first attempt at finding Zt- One straightforward construction 
of a polynomial matrix is as follows 

Z't-.^ J2 «j>i>(«jl. (68) 

|i| = |j|=a 

where \vi) ~ |l)^*i «) 12)®*^ . . . It is easily verified that ^ holds for Py 

given above, which acts on the full space 1/®°. The condition on guarantees that 
Z^ is Hermitian and will therefore have real eigenvalues. 

Example 3.1. To get some intuition about polynomial matrices, consider the fol- 
lowing simple example. Let T(xi, X2, x^, x^) = x\x2X3Xii -\- x'^x\xiX2. Note that T 
is Hermitian and a sum of monomials of symmetric degree d ^ A. To see how we 
can write T as a matrix consider Z'rp e End(y^^) where V ~ 'C^ 

= |1,2)(3,4| + |3,4)(1,2| . (69) 

Clearly Zip is Hermitian and ((a;|'^^)Z^(|a;)®^) = T(x) as promised. 
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However, our first attempt at finding a mapping to matrices is somewhat unsat- 
isfactory, both because the ordering of the factors of V is arbitrary and because the 
polynomials are embedded into a vector space of dimension n^" (dim(End(F'*"))) 
which is much larger than vector space of Hermitian polynomials which has dimen- 
sion ("^"7^) • In the following wc will define a polynomial matrix based on Zt 
that resolves these issues. In particular, we will now see that the "relevant" part 
of Z'rp can be chosen to be unique. 

Recall from Section that jx)®" has support only on the symmetric subspace. 
This implies that 

{xl'^^iUaZ'Mlx)'^'' = (xp^Z^la;)®" . (70) 

If we define the polynomial matrix 

Zt := TlaZ^Ua (71) 

it still satisfies (|67p and is clearly also Hermitian. From ([25)1 it is easy to see that 
we can write HZ^H in terms of the number state basis for the symmetric subspace 
as 

Zt^ J2 CiCji!j!aj|i)(j|= J2 ^"J|i)<jl- (72) 

|i| = |j|=a |i| = |j|=a 

Since the number state basis is orthonormal, it is clear that this mapping is now one- 
to-one from the Hermitian polynomials to the Hermitian matrices on the symmetric 
subspace of V'^"'. 

Example 3.2. Let us now revisit Examvle \ 3.1\ and show how T can he expressed 
as a matrix Zt on the symmetric subspace. First of all, note that the element xiX2 
corresponds to i = (1,1,0,0) and X3X4 to i = (0,0,1,1). In both cases i! = 1!1!. 
Hence from (|72p we have 

Zt = ^(|(1,1,0,0))((0,0,1,1)| + |(0,0,1,1))((1,1,0,0)|) . (73) 

Note that this mapping is different from one that is often used in the litera- 
ture |33] in which T is mapped to an operator on a vector space with its basis 
vectors identified directly with monomials, in our notation 

Zt= J2 (74) 

|i| = |j|=a 

Note that this definition does not meet our requirement (|67p which will prove to 
be essential to our analysis in terms of a de Finetti theorem. 

3.1.2. Operations on polynomials. It is instructive to sec that many natural op- 
erations on polynomials have an analogue in terms of matrix operations for our 
definition of polynomial matrices. 

We first consider the extension of the polynomial to a polynomial of higher 
degree. More specifically, consider the operation T{x) — > T'{x) = T{x)r'^{x) which 
maps an order d = 2a polynomial to order d + 2. If we define the unit vector in 
the obvious way then from (j40p we have 

T'{x)= ^ J24x*'^'"'x'^'"' , (75) 

|i| = |j|=a k 

and so we find 

Zt'= E 5:^^^^i^^|5±ltl!«i|i + eO(j + ek|=H(Pv'®/„)n. (76) 

|i| = |j|=a k 
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The first expression comes from ([7^ , wliile tlie second is clear from the observation 
that {x\In\x) = X^fc^fe^fc the definition of Z'j, and Zt ([7T|) . 

Clearly this formula extends such that if T'(.t) ~ T{x)r^^^~°-\x) then we have 
Zt' = IVi{Zt ® /^'•^~"'')nf . A useful identity is the following, consider a matrix 
G End(Sym(F'*^)) and its partial trace Ma = tr^a{Mf) then we have 

U{MiZt') = tr{Ah{ZT ® = tr{MaZT). (77) 

The first equality comes because liiMglit — Mi and the cyclic property of the 
trace, while the second is a result of the definition of partial trace. 

Another natural operation we can perform on a polynomial is to act with the 
Laplacian which maps a Hermitian polynomial of degree 2a to a Hermitian poly- 
nomial of degree 2a — 2. We find 

AT(x)= y y zufeajx*'-'''^^^--'' (78) 



|i| = |j|=a k 



E {T.^^k + mk + i)a\+Ax*-^^ 

|i| = |j|=a-l \ k / 



(79) 



On the other hand we can take the partial trace of T using (|32p to find 

/TTTT 



tri(ZT) = iE E v^^«i|i-et)(j-et| (80) 



a 



= ^ E ( E(^* + 1)0* + i)"Ke: ) 7^ii)(ji ■ (81) 

|i| = |j|=a-l \ t I ^ 

and by comparison with ([7^ we see that a}\.X\(ZT) is the matrix that represents 
the polynomial AT'(x), and so up to a scale factor we can interpret the partial trace 
as the Laplacian. 

3.1.3. Sum of squares decompositions. The main purpose of mapping polynomials 
to matrices in this fashion is to establish the existence or otherwise of a sum of 
squares decomposition in which a polynomial of symmetric degree d is written as 
a sum over squares of degree d/2 polynomials. 

Just as we established a natural mapping of monomials of symmetric degree d to 
matrices on End(Sym(T^®'^/^)) it is possible to establish a mapping of monomials 
x' onto vectors in Sym(T^'^'^/^). Our goal will be to find a unique mapping from 
complex polynomials of degree (0, a) 

T{x) = E "i^;* , (82) 

|i|=a 

to a vector, where we want that 

T{x) = (Zt|2:)®" . (83) 
Proceeding as before we first consider the following trial vector 

\Z'^) E "il^i) ' (84) 

|i|=a 

where as before \vi) = \ i)^^i ^ |2)®*2 _ ^ _ Using the same argument as before 

|a;)®° lives on the symmetric subspacc Sym(V^'^'') and we project the vector {Z^\ 
onto that subspace to define 



|i|=a |i| = 
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where the second expression uses (i|uk) = Cii!(5i^k- Clearly we can also associate 
the complex conjugated polynomial T*{x,x*) with the vector \Zt)- Again the 
orthogonality of the number state basis implies that this mapping is unique. As a 
result we may map any vector \Z) e Sym(y^") uniquely to a polynomial in Tz of 
the form 

Now suppose that polynomial T{x, x*) is such that T'{x, x*) = T{x, x*)r'^^ maps 
to Zt' > 0. Wc may write 

Zt' =Y.^^\^^)^^^\ (86) 

i 

where each eigenvalue > and the eigenvectors \Zi) e Sym(T^®(''+''')). As a 
result there exist Ti{x) such that 

T'(x, X*) = Tix, x*y = J2 Kmx)\^ (87) 

i 

the right hand side is known as a sum of squares decomposition for T'{x^ x*). The 
existence of this sum of square decomposition implies that T{x,x*) > for all x. 
The set of polynomials T{x,x*) of symmetric order d ~ 2a such that T'{x,x*) = 
r(a;,a;*)r2(^"'') has a sum of square decomposition will be labelled Sc{d,£). 

3.2. Matrices for polynomials in real variables. Let us now consider the case 
of real variables. Our goal as before will be to find a unique mapping from real 
polynomials T{x) of even degree d = 2a to matrices Zt such that {xl®"" Zt\x)'^°' = 
T{x) 0. Naturally, we could proceed just as in the complex case. However, whereas 
our mapping was indeed unique for the case of Hermitian polynomials, it no longer 
is for the case of real valued variables. To see this, is it useful to think of the matrix 
Zt as a vector, where we want that 



T{x) = {xf^\ZT) . 

Note that when proceeding as in the complex case above \Zt) is of the form 

\Zt)=Y.PI |i) |j) , (89) 



for some coefficients /?■. That is, the first F**" systems correspond to the conjugate 
of variables and the second V®"" to non-conjugated ones. Clearly, in the real case 
there is no such separation between conjugated and non-conjugated variables and 
there are many ways to subdivide them into two components to form a matrix. 
Hence, the mapping is no longer unique. 



3.2.1. Construction of matrices. However, it becomes clear when looking at 
that our problem now has an additional symmetry that again allows us to restrict 
to a certain subspace on which polynomials are represented uniquely as matrices. 
In analogy to our discussion of complex polynomials of order (0, a) in the previous 
section we can uniquely map our real polynomials to vectors in Sym(y*'') as follows 

(90) 

We may map this uniquely to a matrix acting on Sym(l/®'') in the usual way but 
the corresponding matrix Zt lives in the maximally symmetric subspace, i.e., Zt G 
MSyni ((R")®"), as opposed to the merely symmetric subspace in the complex case. 
As we noted earlier all matrices in MSym ((M")'*'^) are symmetric and therefore have 




^Recall from Section 12.3.11 that the optimization of any homogeneous polynomial can be per- 
formed by optimization of a homogeneous polynomial of even degree. 
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real eigenvalues. This fact is important later for sum of squares decompositions. In 
analogy to the complex case orthogonal transformations of the variables \x) — > 0\x) 
result in orthogonal transformations of the polynomial matrix Zt Qr^®"^ ZtO®'^ 
and this group action corresponds exactly to the original group action we defined 
on T{x) such that Zi^^yT] = g^^^ZTg®" for each g G 0{n). 

3.2.2. Operations on polynomials. It is again instructive to consider how natural 
operations on polynomials map to operations on matrices, or their vectorized forms. 

Firstly we will define an operation on the vector \Zt} corresponding to T{x) 
to produce the matrix corresponding to T'{x) = T{x)r'^{x). In order to do this 
we define the vector |r^) = ^ \ii) which corresponds to r^{x) = (a:|®^|r^). Then 
following the mappings discussed above we define \Z'j„) = \Zt} ® \r^) for which 
it is clear that (a;|®*^'^+^^ and so as before a projection onto the symmetric 
subspace preserves this property and we define \Zt') = Ild{\ZT) <8) l^*)). Notice 
that the matrix corresponding to |r^) is I„ and so we could also use the notation 
\Id) instead. However in contrast to the complex case Zt' 7^ HaiZr <8 In)IIa as is 
clear since the resulting matrix is not maximally symmetric. 

As in the complex case this formula extends such that if T'{x) — T{x)r'^'^^~°'\x) 
then we have \Zt') = Il2i{\ZT) ® |I„)'^(^"")). A useful identity is the following, 
consider a matrix Me € MSym ((M")®^) and its partial trace Ma = tr^aiMi) then 
we have 



triMiZT') = {Me\ZT') = {Mi\i\ZT) ® |I„>®('^'')) = tr(A/,(ZT ® C^^"'^))) 



The first equality is the definition of these matrices in terms of vectors. The second 
identity holds since |M^) G Sym(y*^^). The third equality converts the vector 
inner product back into an inner product of the corresponding matrices, and the 
final equality is the definition of the partial trace. 

We can also develop operations that correspond to the Laplacian acting on the 
Zt- In order to do this, let us define the following operation from V'^'^ to \/'8"i-2 



then define the following operation which commutes with all permutations 



It should be noted that this operation has a connection with representations of the 
orthogonal group [41], but we will not need this here. The effect of 9i on our basis 
for V'®'^ is straightforward to compute 



triMaZr). 



(91) 





(93) 




(94) 



k 



As a result we find the following 




(95) 



and this is clearly the vector \AZt) corresponding to AT{x) on V®'^. Since 9i and 
lie; commute we find that 

0i\Zt) = \AZt) ■ (96) 
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We can now understand the partial trace on Zt as follows. For any matrix 
M e End((C")^") we have 

n 

tri(M) = ^(la-i ® (z|)M(Ia_i ® . (97) 
1=1 

Associating M with a vector \M) = J2ij Mij\i)\j) e (C")«'2a ^^iat the partial 

trace is equivalent to computing 

n 

tri(M)=^(I®(z|)(I®(z|)|M)=0(,,2a)|Af) . (98) 
Note that when applied to a number state which is permutation invariant 

rf(rf-l)e(a,2a)|i) =ei|i) • (99) 

Hence, for any \M) of the form = J2iai\i) wc have d{d - l)tri(M) = 6*1171/) 
and thus 

Zat = l)tri(ZT) . (100) 

so that partial trace once again corresponds to the Laplacian. 

3.2.3. Sum of squares decomposition. In this section we wish to develop a semidef- 
inite programming method to find sum of squares decompositions for T[x). In the 
previous section we established a unique mapping of a real homogenous polynomial 
of order d to a unique matrix Zt G MSym ((M")«"^). However MSym ((R")'*'^) 
is not closed under multiplication and so some eigenvalues of Zt could be nega- 
tive. Yet, there could be some symmetric (but not maximally symmetric) matrix 
Zt G End(Sym(y®°)) such that {xf" Zt\x)'^'' = T{x) and Zt > 0. This would 
imply the existence of a sum of squares decomposition for T as wc will see below. 
The required property is that Zt = Zt + Z for some \Z) such that Hc;|^) = so 
that {x\'»'^\Z) and we have {x\'^'^\Zt) = T{x). 

Not all positive polynomials T{x) can be written as sums of squares but as in the 
complex case we can find procedures to check whether or not T'{x) = T{x)r'^^^'^ 
can be written as a sum of squares. We will say that T{x) £ S{d, £) when T is a real 
polynomial of degree d if there exists some Z G End(Sym(F'^^)) having Il2e\Z) = 
such that 

Zt' +Z> 0. (101) 

If such a Z exists we have 

Zt' + Z = J2^^\Zt,){ZtA (102) 

i 

where each eigenvalue Ai > and the eigenvectors {Zt^) G Sym(y^''). As a result 
there exist Ti{x) such that 

T'{x) = Tixy-^ix) = ^^iT,ix)f (103) 

i 

the right hand side is known as a sum of squares decomposition for T'{x). The 
existence of this sum of square decomposition implies that T{x) > for all x. 
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3.2.4. Decomposing polynomial matrices using spherical harmonics. Above, we de- 
termined that our polynomial matrices can be represented on the maximally sym- 
metric subspace MSym ((R")**"). As such, it is not hard to see that there exists 
another way to find the corresponding matrices explicitly, if we already know the 
decomposition of the polynomial into its Fourier components 



T{x) 



2r N{j,n) 
j—0 ?n — 1 



ti 



(X) . 



(104) 



Let us first consider how to express Sjm{x) itself as a matrix Msj^. Note that by 
the Funk-Hcckc formula (sec also Lemma 



Sn-l 



{{x\z)r 



{z)dz 



we have 



Xin,r,j)sjmix) 



(105) 



where ujn is the surface area of the sphere S"" ^ and \{n,r,j) is a normalizaton 
defined in Lemma lA. II Letting 

Af.,„ := (106) 

is thus the corresponding maximally symmetric matrix. By linearity we can thus 
write the polynomial matrix as 

Zt ^Y^^j^^^s,^ ■ (107) 

4. Moments and moment matrices 

4.1. Moment matrices for complex variables. Once again, we consider the 
case of complex variables first, where much is known from quantum optics and 
quantum information in which complex vector spaces are the most relevant ones. 

Let us briefly recall the concepts of moment sequences and moment matrices in 
optimization (see [55] for an excellent survey) and translate them to our notation 
and symmetrized form. Let /i be a measure on CP"~^. Given a monomial x*^os^ 
with |i| = |j| = a as described above we can define a moment of symmetric order 
2a as follows 



X*' dfi{x) 



(108) 



We define the moments in terms of components of the normalised vector x = a;/||a;||2 
so that the integrand is a proper function on CP"^^. A question of great concern 
is to determine whether some arbitrary (truncated) sequences y = {yi.jjij can be 
written in the form of ()108[) for some measure dfi. Such a measure is also known as 
a representing measure. One of the objectives of Lasserre's hierarchy is to provide 
us with a procedure that we can use to determine whether y admits a representing 
measure and in general this problem can be partially addressed by forming suitable 
matrices of moments and testing for positivity of the matrix. 

Let Zi j denote the polynomial matrix for x*^x^ as defined in the previous section. 
Then the corresponding moment can be recovered from this matrix as 



/ 

JCP" 



(x|*^°Zij|x)®°dAi(a;) =tr 



\x){xrdf,{x) 



= yi,3 



(109) 



= :M„(a<) 

Here we see that the matrix Ma{fi) € End(Sym(y*°)) contains information about 
all the moments of symmetric order 2a and is known as a truncated moment ma- 
trix |28| . The particular arrangement of moments in this matrix differs from that 
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of many truncated moment matrices that can be found in the hterature but it is 
chosen to correspond to our chosen mapping of polynomials to matrices and has 
many useful properties. For example, as always in such constructions, Ma{n) > 0. 
Also normalization of the measure requires that tr(Ma(/J.)) = 1. For all a' < a we 
have Ala'ifJ.) — tr^a' (^^a(Ai))- Finally as a result of this definition we can evaluate 
the average of a polynomial of symmetric degree 2a over the measure dfi(x) as 
follows 

tr{Mai^L)ZT) = / {x\^^ZT\x)'^-dij{x) 

JCP"-i 

= [ T{x)dfi{x) . (110) 

JCP"-i 

This justifies our choice of definition of the truncated moment matrices Ma- This 
average value can also be inferred from the truncated moment matrix M{{fi) for all 
£ > a. Let us define T'(x) = T{x)r{x)^^^~°'\ and the corresponding matrix Zt' ■ 
We then have by (|77)) 

triMt{fi)ZT')=tr{MaitJ.)ZT)^ f T{x)dfiix) . (Ill) 

JCP"-i 

The set of valid truncated moment matrices can be defined as follows 

Mc{a) = {M £ End(Sym(y®°))|3 dn{x) : M = / |x)(x|^"d//(x)} . (112) 

JCP"-i 

Testing whether a representing measure exists for a matrix M G End(Sym(y^°)) 
so that AI £ Aic{o.) is a difficult task in general. That is, it is difficult to decide 
membership of A4c{o-)- However, there are some straightforward necessary condi- 
tions for membership of Mcio.) which we can test. As noted above we must have 
AI > and tr(M) = 1, i.e., M is a state. Furthermore, M is extendible in the sense 
that for all £ > a there exists a matrix Ade € End(Sym(F®^)) such that AIi > 0, 
tr(M^) = 1 and tr^_a(M^) = M. For a given AI the question of whether such an 
Alf exists is an instance of a semidefinite programming feasibility problem and is 
computationally tractable [4]. This leads us to define the following set 

Mc{a,£) ={AI e End(Sym(V^^"))|3Mf e End(Sym(T/®^) : Ale > 0, 

tr(A/,) = 1, tr,_a(M,) = AI} (113) 

and we have the containment A4c'{o-) C 7Mc(i,^) V^. 

4.2. Moment matrices for real variables. Let us now turn to the case of real 
variables. As it turns out the main ideas are essentially analogous. In analogy to 
the above, one defines a moment of order i with as iji ~ J x^dfj.{x), where is a 
monomial determined by i as outlined above [28| and dfi(x) is a normalised measure 
over the hypcrsphere S*""^. As before we are interested in determining whether 
a given sequence of numbers {yi}i can be true moments for some representing 
measure. Once again necessary conditions for this can be determined that are 
related to matrix positivity. 

Let Zi denote the (infinite) polynomial matrix for as defined in the previous 
section. Note that Zi G MSym ((K")**"), that is, Zi is a maximally symmetric 
matrix. We can now recover the moment yi using this matrix as 



xl'^'" \Zi) dfi{x) ^tr 



\x){x\'^''dti{x) Zi 



(114) 
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Here we see that the matrix Ma{fJ-) contains information about all the moments of 
order d = 2a and is known as a truncated moment matrix |28| . As in the complex 
case we have Main) > 0, tr(M) = 1 due to normalization of the measure and for 
all a' < a we have Ma'ifJ.) = tr^o'(A^o(M))- Notice, however, that since now x € 
we have € MSym((M")®") and thus also Ma(/i) S MSym ((M")^^'^). 

Finally as a result of this definition we can evaluate the average of a polynomial of 
degree 2a over the measure dfi{x) as in the complex case 

tr{Ma{fi)ZT) = / {x\^''ZT\x)^-dfi{x) 

= / T{x)dfi{x) . (115) 

As before this average value can be inferred from the truncated moment matrix 
Miifj.) for all £ > a. As before, let us define T'{x) = T{x)r{x)'^^'^'''\ and let Zt' 
denote its polynomial matrix. We then have by (|91]) 

tr{Ah{n)ZT') =tr{Main)ZT) ^ [ T{x)d^i{x) . (116) 

The set of valid truncated moment matrices can be defined as follows 

M{a) = {M e MSym ((R")^'^) |3 d^i{x) : A/ = / \x) {x\'^^'' d^l{x)} . (117) 

Testing whether a representing measure exists for a matrix M S MSym ((R")*^") 
so that M £ A4{a) remains a difficult task. However, as in the complex case 
there are several straightforward necessary conditions for membership of A^(a). As 
noted above we must have M > and tr(M) = 1. We also again want that M 
is extendible such that for any £ > a there exists a matrix Mi S MSym ((R")*^^) 
satisfying > 0, tr{Mg) = 1 and tri-a{Mi) = M . For a given M the question of 
whether such an Mi exists is once again an instance of a semidefinite programming 
feasibility problem. That is, membership of the following set 

M{a,£) = {M G MSym ((R")^'') \3Mi G MSym ((R")®^) : > 0, 

tr{Mi) = 1, tr,_,(M,) - M} (118) 

can be decided using a semidefinite program. Clearly, wc again have the contain- 
ment M{a) c M{aJ) V£. 

5. P AND Q-REPRESENTATIONS 

In our proof of convergence, we make use of two well known concepts orig- 
inating in quantum optics [37] • These are the so-called Glauber-Sudarshan P- 
representation and Husimi Q-representation of a matrix. We will see that they 
have a close connection to what are known as moment matrices in optimization in 
our symmetrized form of the SDP hierachy. 

5.1. On complex projective spaces. 

5.1.1. P -representation. Let us now consider an arbitrary matrix M € End(Sym((C") 
with M > 0. It is well-known from quantum optics (see e.g. |37] or [8] for the form 
used here) that any such M can be written as 

M= f PMix)\x){x\'^^dx , (119) 

for some coefficients given by the function Pm G L^(CP"^^), i.e. the space of 
square-integrable functions on the complex projective space CP"^^ (see [5] for a 
quantum information treatment). This (jllOp is known as the P -representation of 
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M. If there exists a Pm such that for all x we have Pm{x) > we also say that M 
admits a positive P -representation. Note that for tr(A/) = 1 

l = tr(il/)=/ PAf{x){{x\x)Ydx^ I PM{x)dx . (120) 

JCP"-i JCP"-1 

As such, we may think of the numbers PM{x)dx as a quasi-probability distribu- 
tion/measure, and indeed one exists for any such M . The question of whether the 
sequence of moments corresponding to M admits a representing measure is thus 
equivalent to asking whether M admits a positive P-representation. As such, our 
symmetrized way of writing polynomials makes it clear that there is a close relation 
between the question asked in optimization of whether a representing measure can 
be found, and the question that is of some concern in quantum optics of whether 
M admits a positive P-representation. 

5.1.2. Q -representation. Closely related to the P-representation is the so-called Q- 
rcprcsentation of M E Sym((C")®^), defined by the value of 

Qm{x) := {xf'M\x)®' (121) 

for all \x) € C". In the language of the previous section, we are now simply inverting 
the mapping of Hermitian polynomials to Hermitian matrices and thinking of M 
as a polynomial matrix for the polynomial Qm- As we know this mapping is one- 
to-one and invertible and so M is determined completely by the values of Qm- It 
is also useful to note that for M > 0, we have Qm{x) > for all x so that while a 
normalised P-function even for a positive M does not have an interpretation as a 
measure we may construct valid measures using Q-functions and positive matrices. 
For the complex case, it is known that there exists a way to convert a matrix' 
P-representation into its Q-representation and vice versa for so-called coherent 
states [37| or finite dimensional systems [8]. 

5.1.3. Evaluation on a polynomial matrix. Before turning to the real case, it will 
be instructive to understand triMZx) where M > with M e Sym((C")'*'^) and 
Zt is the polynomial matrix of some polynomial T{x). Because such an M always 
admits a P-representation we have 

tr(MZT) = / PM{x)(x\'^^ZT\x)'^^dx 

= / PM{x)T{x)dx . (122) 

That is, we can always understand this trace as the evaluation of the polynomial 
T at points x weighted by coefhcients Pm{x). As these arc not necessarily posi- 
tive, P]\i(x)dx is not necessarily a measure. This observation will lead to a nice 
interpretation of solutions of intermediate (non-optimal) levels of the SDP hierar- 
chy discussed later. However, for any M with tr(A/) = 1 that admits a positive 
P-representation we do of course have such a representing measure which will be 
the case if the final (optimal) level of the SDP hierarchy is reached. 

5.2. On the hypersphere. Let us now turn to the case of real variables. As it 
turns out the main ideas are essentially analogous, however, we need to prove the 
existence of a similar P-representation. For any M € MSym ((M")*^^) we would 
like there to exist a function Pm{x) on S*""^ such that 

A/ = / PMix)\x){xf^'dx . (123) 

We will use standard arguments to establish this. We start by noting that all 
M e MSym ((M")®^) do possess Q-representations. 
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5.2.1. Q -representation. Just like in the complex case, we can again consider a 
Q-representation for any M £ MSym ((K")**^), defined by the value of 

QMix) := {x\^'M\x)^' (124) 

for all X e 5"^"'^. As before, we are now again simply thinking of M itself as a 
polynomial matrix for the polynomial Qm and the mapping we described in the 
previous section was one-to-one and invertible, only zero matrices map to zero 
polynomials and vice versa. Note that any maximally symmetric matrix M G 
MSym ((R")®^) corresponds to a polynomial of degree at most 2i and hence M is 
fully determined by (|124p evaluated at 2^ -I- 1 distinct points. Clearly, for M > 0, 
we have Qm(x) > for all x. 

5.2.2. Existence of a P -representation. The following lemma shows that any max- 
imally symmetric matrix does indeed admit a P-representation. The proof of this 
is analogous to the derivation in quantum optics |37j (see also [S] for a quantum 
information treatment). 

Lemma 5.1. Any maximally symmetric matrix M G MSym((M")®^) admits a 
P -representation 

M ^ [ PMix)\x){xf^dx , (125) 
for some function Pm(x) on the sphere S"~^ . 

Proof. Note that the space of maximally symmetric operators that admits a P- 
representation is a linear subspace of MSym((M")^^) defined as 

Sp |y G MSym((R")®0 I 3Py s.t. J Py {x)\x) {x\®^ dxj . (126) 

Let Sj^ := {W G MSym((R")»^) | W G Sp, tr{V'^W) = 0} denote its complement, 
i.e. the maximally symmetric matrices that do not admit a P-representation. Our 
goal will be to show that the only maximally symmetric matrix M G Sp is M = 0. 
By defintion, we have that for any M G Sp and any V E Sp 



tr (V' M) = tr (VM) = / Pv{x){x\^' Mlx^'dx (127) 
Pv{x)QM{x)dx = , (128) 



where Qm{x) = (xp^M|x)^^ is the Q-representation of M . Note that since Pv{x) 
is an arbitrary square integrable function on the hypersphere, (jl27p implies that 
Qm{x) = for all x G S'"~^. Since Qm{x) is a polynomial of degree 2i we con- 
clude that it is the zero polynomial. Recall that there is a one-to-one mapping 
from polynomials to maximally symmetric matrices and the matrix M for the zero 
polynomial is Af = 0. □ 

As before, we speak of a positive P-representation if Pm{x) > for all x. Note 
that for tr(Af) = 1 we have 



l=tr(Af)=/ PM{x){{x\x)ydx = PM{x)dx . (129) 

As such, we may again think of the numbers PM{x)dx as a quasi-measure, and 
indeed one exists for any such M . The question whether the sequence of moments 
corresponding to M admits a representing measure is thus equivalent to asking 
whether M admits a positive P-representation. 
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5.2.3. Evaluation on a polynomial matrix. The following observation again leads 
to a nice interpretation of the solutions at intermediate (non-optimal) levels of the 
SDP hierarchy. More precisely, using the P-representation we can write tr(AfZT) 
where M > with M g Sym((C")®^) and Zt is the polynomial matrix of some 
polynomial T(x) as 

triMZr) = I Pm{x){x\®^ Zt\x)®Ux 



PM{x)T{x)dx . (130) 



S" 



That is, we can always understand this trace as the evaluation of the polynomial 
T at points x weighted by coefficients Pm{x). Again, as these are not necessarily 
positive at an intermediate level, PM{x)dx is not a measure. 

5.2.4. Relation between the P and Q -representation. Wc arc now ready to work out 
the relation between the P and Q-rcprcscntations of a maximally symmetric matrix 
M . From the definition of the Q-rcprcscntation we sec that they are related as 



Qm{x) = {xrM\xr' = {xr yj^^ ^ PMivMivrdyj Ix^' (Ul) 

{{x\y)f'PM{y)dy. (132) 

That is, the Q-representation can be written as a convolution of the P-representation. 
Note that this operation can be seen to commute with the action of the orthogonal 
group on functions on the hypersphere. Furthermore, recall from Section 12.41 we 
can decompose Pm = J^jmPfL^jmix) and Qm = J^jmljLsjmix) where pj^ and 
Qjf^ are the Fourier coefficients of Pm and Qm respectively. Recall also that since 
Qm is a polynomial of order 2£ only j ~ 0,2, ...,2£ occur in the sum over j. In 
essence, the following lemma is an easy consequence of the Funk-Hecke formula ([55)) 
as employed in Lemma 12.31 

Lemma 5.2. Let M G MSym((R")®^) be a maximally symmetrix matrix with P- 
representation M = J2jmPjm^jm '^'^'^ Q-representation Qm{x) = (.xp^Mja;)'^^ = 
Y^jmlfL^j^ix)- Then 

qfL = ^KnJ,j)pjL, (133) 
where X{n,£, j) is defined in Lemma \A.l\ 

Proof. As the spherical harmonics are orthogonal, we can recover the Fourier coef- 
ficients of Qm{x) as 

tr (S'j„,A/) = / Sj„^{x){x\®^M\x)®^dx (134) 

jm{x)Q M {x)dx qj'^jn' Sj,rn'{x)Sjm{x)dx (135) 

= ^ , (136) 

where we have again used the fact that the trace is linear and cyclic. Writing M 
in terms of its P-representation and using Lemma 12.31 we thus have 

= c.„ tr {S'^,^M) = ^ pf,^, tr (5|„,4„,,) = A(n, j) pf„, , (137) 

j'm' 

which is our claim. □ 
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The central idea of our main result will be that for j <^ £ we have that pj^ is 
(up to normalization) almost equal to g*,^. The following lemma makes it clear, 
which normalization we want to choose to relate the P and Q representations for 
any state M. This is done by comparing the coefficients and ^q-'^. 

Lemma 5.3. Let M £ MSym((M")'^'') be a maximally symmetric matrix satis- 
fying M > and tr(M) = 1 with P -representation M ~ J^jmPfm^jm '^'^'^ Q~ 
representation Qm{x) ~ {x\®^M\x)®^ = 12jm1jrn^jm{^)- Then the matrix 

= vrnr^ I QM{x)\x){xrdx (iss) 

a;„_iA(n,^,0) J g^-i 
is satisfies M > and tr{M) = 1. 

Proof Since M > 0, we have Qm{x) > for all x £ 5""-^ and hence M > 0. It 
remains to prove that tr(M) = 1. Using the Q-represention of Af and Lemma [2.41 



we can write the trace of the integral in (|138[) as 

IE'?'™ / s,M\x){xrdx] =E9i™tr(5|„0 = ^ . (139) 

Similarly, using the P-representation of M and Lemma YTM 

l = tr(M) = -^. (140) 



Using the relations between the P and the Q-representation of a maximally sym- 
metric matrix fLemma 15.21) we thus have that 

tr (m\ = ^ = 1 . (141) 



□ 



6. The existence of approximate representing measures 

Our key result is a theorem on the existence of approximate representing mea- 
sures for a set of possible moments given by a maximally symmetric matrix M S 
A^(fc, €). Specifically we will show that if M S A^(fc, t) then M can be approximated 
by a maximally symmetric matrix M that has a normalised positive P-function with 
an error that scales like 

Our main theorem was inspired by a set of results in quantum information that 
go by the name of finite quantum de Finetti theorems [7J [24l |35j [23l [25] . In its 
original form [TU], de Finetti's theorem states that any exchangeable probability 
distribution over an infinite sequence of random variables X\ , Xi , . . . is equal to 
a convex sum of product distribution^. Diaconis and Freedman [T3] later showed 
that a similar statement still holds approximately if the distribution is not infinitely, 
but merely ^-exchangeable. In particular, they showed that for any exchangeable 
probability distribution Pxi,...,Xe of £ random variables, the distribution on the 
first Xi, . . . , Xa is closely approximated by a convex sum of product distributions. 
More precisely, there exists some measure fi on the set of distributions such that 
for all xi, . . . , 

PXu...,xAxi,---,Xa)^ j Px{xi)...Px{Xa)dll{Px) . (142) 



A probability distribution is exchangeable if for all permutations tt and all xi,X2,--- the 
probability of a sequence and its permutation is the same, i.e., P{xi, X2, ■ ■ .) = ^'(a:^7r(i)i ^7r(2)i • • ■)• 
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for some approximation parameter depending on a and i and closeness is mea- 
sured in terms of the statistical distance. As quantum states can be understood as 
a generalization of classical probability distributions, De Finetti theorems for infi- 
nite [22j[34l|6l[l7] as weh as finitely exchangeable quantum states [51 171 [2411351 [231125] 
have attracted much attention in quantum information. 

6.1. For the case of complex variables. 

6.1.1. Statement and consequences. We first state a finite de Finetti theorem for 
the complex case that is already known from quantum information. We note that 
it is in fact possible to prove a complex de Finetti theorem in a rather different way 
along the lines of our real de Finetti below, but as this is likely only of interest to 
quantum information theorists we omit an alternate proof |15j . Instead, we simply 
state the following result proven in [7], translated into a language that provides 
some insight into the problem at hand. 

Theorem 6.1 (Unitary finite deFinetti Theorem [7]). Let n,a,i £ N and a < £. 
Let M G Sym ((C")**^) be a symmetric matrix that is a state (i.e., tr{M) = 1 and 
M >0), and let Qm{x) = {x\'^^ M\x)®^ he its Q -representation. Define 

Ma ■■= / Q m{^)\x) {x\^'' dx . (143) 



dim(Sym((C")®^)) 

Then the reduced matrix Ma '.= tr^aiM) is approximated by the matrix Ma as 



M„, - M„ 



^ < ^ . (144) 



Furthermore, Ma is a state (i.e.,tr{Ma) > 0, tr{ALa) ~ i), and has a positive 
P -representation. 



A useful consequence of the de Finetti theorem is that by ([2T|) for any matrices 
Zt with 1 1 Zt 1 1 oo < 1 the difference between t r ( Zt Mi ) and t r ( Zt M2 ) is no more than 
e. In particular, this means that if Zt is a polynomial matrix and thus U{ZTMa) 
is the evaluation of the polynomial T at points x weighted by the cofRcients of its 
P-represcntation PM^{x)dx (see Section [5. 1.3L then the evaluation of T at points 
weighted by dm{x) = QM{x)dx/ dim(Sym((C")®^)) takes on almost the same value. 
Note that whereas P]\i^{x)dx is not a measure, dm{x) actually is one. 

6.2. For the case of real variables. Wc now proceed to prove a de Finetti the- 
orem for real, maximally symmetric matrices. It should be noted that a de Finetti 
theorem for arbitrary real matrices in the symmetric subspace cannot hold [S]. 
In fact, this even lead to some speculation that de Finetti theorems may not be 
useful to study convergence of hierarchies of semidefinite programs for polynomial 
optimization |2]. However, as we saw earlier we do not care about arbitrary real 
matrices, as the relevant space is the space of maximally symmetric matrices. In- 
deed, the counterexample given in [S] is real, but not maximally symmetric. Intu- 
itvcly, the reason why our proof fails for general real symmetric (but not maximally 
symmetric) matrices is that the proof of the existence of a real P-rcprcscntation 
fLemma l5.2p fails at the point where wc need that Qm{x) = implies that M = 0, 
since general symmetric matrices are not equivalent to unique polynomials over real 
valued variables. 

6.2.1. A de Finetti for polynomials. We first prove a de Finetti theorem where the 
approximation is in terms of a norm that relates directly to the optimization of 
polynomials on the sphere. More precisely, we will show approximation in terms of 
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the i^i-norni on the space of maxnTially symmetric matrices A G MSym ((R")**^) 
defined as 

\\A\\fi := sup triZpA) , (145) 

ll-P|U<i 

where the maximization is taken over homogeneous polynomials F with polynomial 
matrix Zp and ||i^||oo = inax3,g5„-i |f (a;)| is the p — > oo-norm for functions on the 
sphere. Claim IA.6I shows that this quantity is indeed a norm. 

Theorem 6.2. Let n,a,£ G N with n > 3, a < £ and £ > 2a^{a + f - 1) - f . 
Let M G MSym ((M")®^) he a maximally symmetric matrix that is a state (i.e., 
tr(A/) = 1 and M > 0), and let Qm{x) = {x\'^^M\x)'^^ be its Q -representation. 
Define 

:= f r(^ + i)r(£ + f) \ r g ( )|^^^ ^^^g^ 

Then the reduced matrix Ma '■— tr^a(M) is approximated by the matrix Ma as 

Ma - Ma 



4a2 (a + f - 1) , , 

< 2 L _ 147 

Fi- 2l + n ^ ' 



Furthermore, Ma is a state (i.e., tr{Ma) > 0, tr{AIa) = 1 arid has a positive 
P -representation. 

Proof. We first show that Ma is a state. Using the fact that A/ > and tr(M) = 1 
it is not hard to see fLemma l5.3p that the matrix 



= u^^Tm / QM{x)\x){xrdx , (148) 

where X{n,£, j) is defined as in Lemma [A. 1[ satisfies Af > and tr(Af) = 1. Hence 

Ma = tr(A7) J^^-— / QM{x)\x){xrdx (149) 

w„_iA(n,£,0) 

satisfies Ma > and tr {Ma) = 1. 

Let us now prove the claimed upper bound. In terms of the Fourier coefficients, 
we can write Ma in its P-representation as 



Ma = J Pj^j^{x)\x){xrdx , (150) 

where 

jm 

Similarly, since A^T is a maximally symmetric matrix, it admits a P-representation 
(Lemma [53J as 

M = [ ^ PM{x)\x){xrdx = Y^P'LSU ■ (152) 
Given its P-representation we can immediately write down the reduced matrix as 
Ma = [ ^ PMAx)\x){x\'^'^dx = Y^pfmSU ■ (153) 

Our goal is to show that the low order Fourier coefficients p^l^ stemming from 
the P-representation are very closely related to those of the Q-representation. For 
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- Ej™Pf™5^, let = E™Pf™5^- For any Zp e MSym((M»)«-) we can 



write 

tr{ZFiMa~Ma) 
2a 



(154) 



j^O m j'm' ^ 

EE^f^te- 

2a 

EE^f. 



a;„_iA(n,£, 0) 



Hj' jYl' 



w„_iA(n,^,0) 



E 

2a 

E 

J=0 



A(n,^,0) 
A(n,€,j) 

X{n,e,0) 



- 1 



w„_iA(ri,^, 0) 



- 1 tr{Z'pMa) 



(155) 
(156) 
(157) 
(158) 



where (|156p follows from the orthogonality of the spherical harmonic matrices SJ„ 
and Sj,^, for j ^ j' and jti ^ m' fLcmma 12. 3p . (jl57p from the relation between 
the P and the Q-representation fLemma l5.2[) . and the last equality again from the 
orghonality of the spherical harmonics. We can thus write the norm as 



I Ma -Ma\\Fl= sup tr(ZF {Ma " Ma) 

l|F|U<i 



2a 

^E 

3=0 



A(n,£,0) 



\Ma 



Fl 



(159) 
(160) 

where the first inequality follows from the fact that || • is a norm and the second 
from Corollary I A. 41 It remains to bound ||A/a||Fi = sup||^||^<]^ | tr{MaZF)\- Using 
the fact that for all x Qm{x) > and aj„/(a;„_iA(n, £, 0)) > we can bound 



< 



4a^ la+'4-l) 



\Ma 



Fl 



\tr{MaZF)\ 



< 



uJn-iX{n,i,0) Jg„-i 



< 



a;„_iA(n,^, 0) 



w„_iA(n,£, 0) 
tr(A7a) = 1 , 



QM{x)F{x)dx 
QM{x)\F{x)\dx 
QM{x)dx 



which yields our claim. 



(162) 
(163) 

(164) 

(165) 
□ 



6.2.2. A standard de Finetti. For our convergence results this is all we need. For 
completeness, however, we state a real de Finetti theorem in its usual form where the 
norm is not related the evaluation of polynomials but related directly to matrices. 
For our proof, let us first establish the following lemma. 

Lemma 6.3. Let n,a,£,j £ N with n > 3, a < £, and even j < 2a. Let M € 
MSym ((M")®^) he a maximally symmetric matrix that is a state (i.e., tr(A/) = 1 
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and M >0), and let Qm{x) = {x\®^ M\x)®^ be its Q -representation. Define 

( r(£ + i)r(^ + f)~ 



Mr, 



V7rw„_i r I 



}M{x)\x){xf''dx , 



(166) 



5"- 



and the reduced matrix Ma tr ^a{M). Then any for harmonic homogeneous poly- 
nomial F of degree j , i.e., for any maximally symmetric matrixZp G MSyni ((M")*^" 

of the form Zp = 'ErnPjmSjm' 



tr 



Zf Ma - Ma 



< '- HZpMa) 



(167) 



2i + n 

Furthermore, Ma is a state (i.e.,U(Ma) > 0, tr(Ma) — and has a positive 
P -representation. 

Proof. The fact that Ma forms a state has been shown in the proof of Theorcm l6.2l 
Recall also that Ma and Ma can be written in terms of the P representation of M. 
Since F is harmonic homogeneous of degree j its polynomial matrix Zp only has 
components for this j, i.e., its P-representation is of the form 

N(n.]) 
7 — r,^ 



We thus have 

tr (^ZpiMa - Ma) 

m j'7n' 



m ^ 

A(71,f,j) 

A(?i,£,0) 
^ _ Xin,i,j) 



c.„_iA(n,£,0)^^'" 



tr ( S"^ S"^ I 



= 1- 



E„F tr CS"" 9" "I 



UiZpMa) 



(168) 
(169) 

(170) 
(171) 
(172) 



where (|170p and p7ip follow from the orthogonality of the matrices S'J„ and Sj,^, 
for j j' and m ^ m' fLcmma 12. 3p . and (jl7ip from the relation between the P 



and the Q-representation fLcmma 15. 2p . It remains to bound p72p . Lemma [A. 3 1 
shows that for any even j 



1 



A(n,£,j) ^ j(^-l) 



(173) 
□ 



X{n,£,0) - 2£ + n 

which is our claim. 

We are now ready to prove a real de Finetti theorem for the matrix Ll-norm. 

Theorem 6.4. Let n,a,eeN with n>3 and a < £. Let M e MSym ((M")*"^) be 
a maximally symmetric matrix that is a state (i.e., tr(M) = 1 and M >0), and let 
Qm{x) = (x|®^M|cc)®^ be its Q- representation. Define 



M„ 



22^c^„ 



i)r(^ + t) 



QM{x)\x){xfdx 



5„-i 



Then the reduced matrix Ala tr^a(M) is approximated by the matrix Ma as 

2a2 (a + f - 1) 



(174) 



Ma - Ma 



< 



21 + n 



(175) 
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Furthermore, Ma is a state (i.e.,tr(Ma) > 0, tr{Ma) ~ i), and has a positive 
P -representation. 

Proof. Wc want to show that the distance between Ma and Ala is bounded. By the 
definition of the Ll-norm 

Ma -Ma = sup tr(ZFiMa-Ma)) , (176) 

1 \\Zf\\o.<1 ^ ' 

where the maximum is taken over all Zp e End(i?'^°). Recall from the proof of 
Lemma [5.2l that the space of maximally symmetric matrices forms a linear subspace. 
As such, we can write Zp = Zp + Z^ where Zp e MSym ((M")®") and tr{Zj^S) = 
for any maximally symmetric matrix S G MSym ((K")*^"). Furthermore, note that 
we can expand the maximally symmetric component as 

2a 

Zp = J2z'p, (177) 

where Zp is the polynomial matrix of a harmonic homogeneous polynomial of degree 
j. For each individual j we can bound 

sup tr (zj,(Ma - Ma)) < -^^f- '- sup tr{Z'pMa) (178) 

\\ \\ <- ^ ^ Zi -\- H II 7J II 1 

ll^pllocj:;-'- ll-'-^j^llac^-L 

j (1±IL - I) 



where the first inequality follows from Lemma 16.31 and the second from the fact 
that Ma > 0, tr(Ma) = 1 and hence tr(Z^Afa) < 1- Using that tr{Zj;Ma) = 
tr{ZpMa) = wc can thus bound 

sup tr (Zp{Ma - Ma)) = sup V tr (z'p{Ma - Ma)) (180) 

ll^i^lU<l ^ ^ \\Zf\\^<1 j ^ ' 



<Y^ sup \.x{z'p{Ma-Ma)) (181) 



,<1 



where the first inequality comes from the convexity of the norm, and the second 
from (jl79p . Our claim now follows by noting that the r.h.s is increasing in j and 
only even j with j < 2a appear in the sum. □ 



7. Hierarchy of semidefinite programming relaxations 

We are now finally ready to return to our problem of optimizing T{x) over the 
sphere. Recall from Section 12.3.11 that it is sufficient for us to understand how to 
optimize homogeneous polynomials T{x) of even degree d = 2a. This will be done 
using a hierarchy of scmi-dcfinitc programs of the kind introduced for polynomial 
optimization problems by Parrilo |33j and Laserre [26j . Wc will determine the 
performance of our hierarchy of optimizations at a given level of the hierarchy. We 
will just describe the case of the hypersphere, although the result for the complex 
projective space is straightforward. 
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7.1. Semideflnite programming relELxations of polynomial optimization. 

The key to the semidefinite programming relaxations of [IHl [33] is to note that one 
can formulate our optimization problem in terms of truncated moment matrices. 
More precisely, the problem of optimizing T(x) over the sphere can be cast as an 
equivalent problem of optimizing 

maximize tr [Zt Ma) 
subject to Ma ^ M{a) , 

where Zt is the polynomial matrix of T{x) as defined in Section 13.21 and Ai{a) 
is the set of truncated moment matrices from Section 14.21 Unfortunately, this 
reformulation does not by itself enable us to solve the problem since characterizing 
the set of true truncated moment matrices AA{a) is very difficult. Yet, this rewriting 
suggests a very natural hierarchy of semidefinite programs in which we relax the 
constraint that Ma G M.{a). 

As we have seen in Section we can define interesting sets of matrices M.{a, £) 
for any £ > a that contain all the truncated moment matrices in A4{a). Unlike 
A^(a), checking membership of Ai{a,£) is a semidefinite programming feasibility 
problem and is thus computationally tractable. This observation will allow us to 
write semidefinite programs that place upper bounds on our polynomial optimiza- 
tion. To see that these bounds are actually meaningful, note that we know from 
the de Finetti theorem of Section [5] that for large £ there does exist a measure n 
such that MaiiJ.) ~ Ma when Ma G M{a,£). That is, there exists a true truncated 
moment matrix Ma{fJ.) G A4{a) that closely approximates Ma- This second obser- 
vation will allow us to show that the optimum of these semidefinite programs is 
close to the true optimum v for large £. 

We hence define our level £ relaxation of the polynomial optimization problem 
as follows 

maximize Tr^ZrMa) 
subject to Ma G M{a,£) , 

where Ai{a,£) is defined in Section [4.21 We will refer to the optimum value of 
this semidefinite program at level £ as vi and use M* to denote an optimal solu- 
tion, i.e., tr{ZTMa) = vi. Note that any feasible solution Ma at level £ admits a 
very nice operational interpretation due to the fact that any maximally symmetric 
matrix Ala admits a P-representation (see Section [5]). More precisely, recall that 
from equation (|130p we have Ti{ZTMa) = Jgn-i T{x)PMa{x)dx. That is, the term 
tr{ZpMa) corresponds to an evaluation of the polynomial T{x) at points x on the 
sphere weighted by P},[^{x)dx. These weights are not necessarily positive but if 
Ma were actually a truncated moment matrix Ma G Ai{a) then PMa{x)dx is a 
normalised probability measure on the hypersphere. Since M{a) C M{a,£) we 
have 

f f > (183) 

and so vg is a global upper bound on the polynomial T{x). This upper bound on v 
was originally emphasised by Laserre and Parrilo [531 US] • 

Let us now write out relaxation explicitly as a semidefinite program based on the 
definition of A^(a,^). Define the degree 2£ polynomial T'{x) = r'^^^~''-\x)T{x) and 
let Zt' denote its polynomial matrix. Introducing the variable M G End(Sym(T^®^)), 
we can then write our relaxations explicitly as 

maximize Ti[Zt'M) 
subject to M > , 

tr{M) = 1 , 

(/-n2,)|M) = . 



32 



ANDREW C. DOHERTY AND STEPHANIE WEHNER 



To see the relation to the previous formulation, note that the conditions on M 
guarantee that Ma = tr^Q(7\/) G A4{a, £) which follows from the definition in Section 
EM On the one hand for all Ma G M{a, £) there exists some M G MSym ((R")®^) 
such that Ma = tr ia{M). Thus the feasibility set of the scmidefinite program 
corresponds to the feasibility set J\4{a,£) of our relaxation. On the other hand 
equation ((9T|) implies that X'c{Zt'M) = tr{ZTMa) and so the objective function of 
the two optimizations are also identical. 

The set of semidefinite programming relaxations correspond to the moment ma- 
trix methods of Laserre [26] . The dual semidefinite program [3] has an interpre- 
tation in terms of an optimization over sum of squares of decompositions |33| . It 
is straightforward to compute this dual semidefinite program. The variables are a 
scalar t and a matrix Z G End(Sym(y^^)) and the SDP is as follows 

minimize t 

subject to tl - Zt' + Z >Q , 
n2,|Z) =0 . 

Comparing the conditions of this dual SDP to the sets of polynomials having sum 
of squares decompositions introduced in Section [2231 we find that wc may interpret 
this semi-definite program as follows 

minimize t 

subject to t - T{;x) G 5(a, £) 

which corresponds to the well-known duality between the Laserre and Parrilo points 
of view. This dual semidefinite program can be seen to be strictly feasible |3] , since 
we can choose a feasible point i , Z = where f > is larger than the magnitude of 
the largest negative eigenvalue of Zt' ■ At this feasible point we have H—Zt'+Z > 
and thus the semidefinite program is strictly feasible. As a result the optimum of 
the dual semidefinite program is equal to the primal optimum vi [U Theorem 3.1]. 

7.2. Performance of the relaxation. Having solved the relaxation at level £ we 
would like to know how close we are to the optimal solution v. More precisely, 
having obtained an optimal matrix M* G Ai{a,£) such that tr{M*ZT) = i^i, we 
would like to know how close Vi is to i'. The key to relating to v is given by 
our real valued de Finetti theorem (Theorem I6.2p . This theorem implies that for 
large £, the matrix Af* is very close to a truncated moment matrix M*(/i) and 
as a result must be close to an achievable value of T(x). Specifically there exists 
a representing probability measure fi(x)dx and a corresponding moment matrix 
M*{ii) that is approximately equal to Ma- From equation (|115l) wc have 

v>^i^ 1 T{x)dii{x) =tr{M'*{ii)ZT) ^tr{M*ZT) ^ VI. (184) 

Since vi > v for all levels of the relaxation, we can conclude that for large I we 
have V£ sa v. What is more, this measure can be determined explicitly in terms of 
the Fourier coefficients of the P- representation of M*. In the following we derive 
an explicit bound on v resulting from v^. The condition on £ ensures that the 
approximation parameter e{a,£,n) < 1. 

Theorem 7.1. Let £ > 2a? (a + f ~ l) ~ f ? ind let T{x) be a homogeneous poly- 
nomial of degree 2a with x G S"~^. Let vi he the optimum of the SDP relaxation 
at level £ and v the true optimum of the homogenous polynomial T(x) over the 
hyper sphere S"""^. Then we have 



vi — e(a, £, n)v < v < vi 



(185) 
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whe 



In particular the relative error in approximation is 

^^<6(a,^,n). (187) 

V 

Proof. The upper bound on v, v < vg is true for all £. Since vg is a solution of the 
semidefinite program there is an M* S A4{a,£) with tr{ZTM*) = i/g. Noting that 
||T||oo ~ I' we have by Theorem 16.21 that there exists a measure ii{x)dx on S*""^ 
normalized to unity, and an associated truncated moment matrix M*{fi) such that 



Zt [m: - A/:(m))] I < e{a,i,n),y (188) 

where e{a,£,n) is as defined above and Dg = J^^^i T{x)^{x)dx = tr(M*(/i)ZT) 
(recah (|115|)'). We thus have 



tr 



vt ~ I T(x)fi{x)dx < / ii{x)dx = V < vi , (189) 

where the second equality is just the normalisation of the measure. As a result, 
from equation (|188p we have 

vt-vt<e{a,£,n)v . (190) 

Combining this with the inequality (jl89p we obtain 

t'f — e(a, n)z^ < vg < v (191) 

which is the lower bound on v and we have established (jl85p . The bound on the 
relative error (|187p follows by a straightforward rearrangement of (|185|) . □ 



8. Discussion 

We have shown convergence of the SDP hierarchy of [33] for optimizing ho- 
mogeneous polynomials on the hypersphere. The key ingredient was a so-called real 
valued de Finetti theorem, inspired by tools used in quantum information theory. 
Our approach allowed us to intepret the actual solution of the SDP hierarchy at 
level i as the evaluation of the polynomial T{x) at an afhne combination of points 
on the sphere. This combination was determined by the so-called P-represcntation 
Pm{x). We could then find an explicit representing measure (i.e., a convex combi- 
nation of points on the sphere) by comparing the Fourier coefficients of Pm (x) with 
those of the so-called Q-representation Qm{x) which is always positive. Whereas 
our approach may seem somewhat foreign at first glance compared to prior ap- 
proaches, we find it rather beautiful in that it not only allowed us to interpret 
solutions at any level i, but also gave a very natural way to obtain an approxi- 
mate representing measure. Relations between such P and Q-representations are 
employed in quantum optics, which served as one of the inspirations of our proof. 

It is interesting to note that our approach bears similarities to the proof by Lau- 
rent [17] of the result of Curto and Fialkow for the case of £ oo, and partially 
inspired our proof. In particular, one can loosely understand part of the approach 
of |27j as a "measurement" of the (infinite) moment matrix by interpolation poly- 
nomials. In spirit, this is loosely similar to the "measurement" that we perform 
here to define the Q- function Qm{x). 

One may wonder whether other de Finetti theorems can be proven this way. It 
turns out that this is indeed the case, for example it is possible to recover the usual 



34 



ANDREW C. DOHERTY AND STEPHANIE WEHNER, 



quantum mechanical de Finetti theorem of [7] with similar approximation param- 
eters using our techniques based on spherical harmonics. Using the appropriate 
spherical harmonics for complex projective spaces an analogue of the Funk-Hecke 
formula [3^ can be found. It should be noted that the necessary convolution theo- 
rems analogous to the Funk-Hecke formula can in principle also be found for many 
other Ricmannian symmetric spaces following |39] . opening the door to a whole 
new class of de Finetti theorems. In turn, this would enable a proof of convergence 
for the optimization of polynomials over other Riemannian symmetric spaces. 
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In this appendix we provide some of the more technical components of the real 
De Finetti theorem and our convergence results. 

Appendix A. Technical lemmas 

In our proof, we will need properties of spherical harmonics expressed with the 
help of so-called Gegenbauer polynomials Pj : M. ^ M. These can be expressed in 
terms of the Rodrigues' formula [20] as 

^.(') - (4)' r^T%(' - ii)' <' - '^^'^ ^ (»^) 

In particular, we will later employ the Funk-Hecke formula |30| which involves 
parameters X{n,£,j) as given in (|193p . As these are rather unwieldy by itself, we 
simplify them using the following lemma. 

Lemma A.l. Let £,j,n £ N and n>3, then 

X{n,£,j):=J t^^Pj{t){l-t^)'^dt (193) 

is given by 

^ r(.+i-^)r(.+^) ^fJ^sevenandJ<2£, ^^^^^ 
if j is odd or j > 2£ , 

where Pj{t) is the Gegenbauer polynomial given by the Rodrigues' formula of (|192p . 
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Proof. Using integration by parts, it can be shown that [301 Lemma 11] 

(195) 

From [d / dty (t^^) we immediately see that \j = for j > 21. We may now simphfy 
the right hand side, by noting that {d/dty{t^^) = {2e)(2£ - 1) . . . {2£ - j + l)t^'^-i , 
to 

2) T{j + ^)T{2l+l~j)jJ ^'''^ 

Let us now first consider the case when j is odd. Note that in this case the function 
f{t) := t^^^^{l — t^y^'^ obeys /(— t) = —f{t). Hence, the integral vanishes and 
\j = 0. For the case when j is even, we can use the fact that f{t) is symmetric 
around the origin t = to write the integral in terms of the /3-function to obtain 
(see e.g. Equation 2.6] or [U Section 10.4]) 

^""^(1 - ty^^dt = ^^'-^'l^^j^l'^^ . (197) 

Combining !(T%\\ . and !fWi} we obtain 

r(^)r(2^ + i)r(£- 



A(n,^,j)= - ^ ^ ^ ^ — ■ (198) 

^ '-"^ V2/ r(2^ + i-.7)r(£ + ^) ^ ' 

Noting that by Legendre's duplication formula we can write 

^(^-^) . ^(^-^) . I (199) 

r(2£ + i-j) r(2(£-^)) 22^-jr(£ + i-§) ^ ^ 

then gives our claim. □ 

To form our approximations, we will need to compare ratios of the functions 
\{n,£,j) for larger values of j to the one for j = 0. Given our lemma above we 
obtain the following corollary. 



(200) 



Corollary A. 2. Let £,j,n G N, with n > 3 and j even. Then 

xin,e,j) ^ r(£ + i)r(£ + f) 
A(n,£,o) r(^ + i-|)r(£ + ^) ■ 

Whereas such ratios can of course be evaluated for any given values of n,£ and j, 
it is less obvious what their asymptotic behaviour (large £) would be. The following 
lemma derives a lower bound on said ratio, for which the assymptotics are very easy 
to understand. This will be used in the proof of our real De Finetti theorem. 

Lemma A. 3. Let £,j,n £ N, n > 3, and let 2£ > j > 2 be even. Then 

A(n^^ ,(^-1) 
A(n,£,0)- 2£ + n ' 

Proof. Our goal will be to lower bound x^n'i'o) ^"^^ ^ ~ '^^ with fc e N. Using 
CoroUarv I A . 2 1 we have 

HnJ,j) ^ r(£ + i)r(^ + §) 

x{n,£,o) r{£ + i- k)T {e + ^ + k) ' ^ ' 
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Using the fact that 

T{£+1) £{e-l)...{e+l~k)T{£+l^k) 



r(^ + | + fc) (£ + | + fc-i)...(£ + f)r(£ + f) 



(203) 



we can rewrite (12021) as 



e + 2-k\ ( I \ fe + i-k^'" 



e + ^ + iJ \e+'j + k-ij- \ 



> (204) 



fe factors 

where the first inequality follows from the fact that for n > and fc > 1 we have 
£ + 1 — k<£ + n/2 and hence {£ + I — k)/{£ + n/2) is the smallest term on the 
left hand side of (|204p . This can be seen by noting that for any x,y > the 
fact that X < y implies {x/y){y + l)/(x + 1) = (xy + x)/{xy + y) < 1 and hence 
x/y < (x + l)/(y+l). □ 

Corollary A. 4. Let £,j, n eN, n>3, and let 2£> j >2 be even. Then 
whenever e{i,t,n) < 1. 

Proof. Let a = 1 — \{n,£,j)/\{n,£,Q). Then Lemma [A. 31 implies a < e{j,£,n)/2. 
This gives us 

^H^-l~-l<2«<.a^,.). (207) 
Hn,£,j) l~a 

Which is the required result. While the second inequality in (|207p is just the result 
of Lemma IA.3[ the first inequality follows from the following small calculation 

1 (1 + 2q)(1 - a) - 1 a -2^2 , , 

1 + 2a - = '-^ '- = > 0. (208 

1— a 1 — a 1 — a 

where the inequality holds when a < 1/2 which holds due to Lemma IA.3I and the 
assumption that e{j,t,n) < 1. We also require a > which follows from Lemma 
ED below. □ 

Let us now establish some further properties of the function A(n,£, j). 

Lemma A. 5. Let £,j,n G N, and n > 3. Then for all even j with 2i > j > 2 the 
function l/X{n,i,j) is increasing in j. 



(209) 



Proof. By (|196p we have 

1 _ 22^r(^ + i-|)r(^ + ^; 



X{n,£,j) ^ r(s^)r(2£+l) 

Hence it suffices to show that F + 1 — |) F + -^^j is increasing in j. Since 
2 = 2k is even, this is equivalent to showing that 

r(^ + i-fc) ^ r(^ + § + fc + i) 

F(£-fc) - F(^+f + fc) ■ ^ ' 

Note that F (^ + 1 - fc) = (£ - fc)F - fc) and F (£ + n/2 + fc + 1) = (£ + n/2 + 
fc)F (£ + n/2 + fc), hence (^1(71) is equivalent to 

^-fc<^+- + fc, (211) 
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which is our claim. □ 

Wc win also need to establish that the quantity || • \\pi docs indeed form a norm 
for maximally symmetric matrices. 

Claim A. 6. The quantity defines a norm on the space of maximally sym- 

metric matrices A € MSym ((M")'»^). 

Proof. Clearly, ||cAj|i?i = c||ylj|i?i for any scalar c. To see that the triangle inequal- 
ity holds note that 

\\A + B\\fi = supitriZpA) +tr{ZFB)) (212) 

F 

< suptr(ZFA) + suptr(Zj.B) = \\A\\fi + \\B\\fi ■ (213) 

F F 

To see that ||A||fi = implies that A^O note that any A G MSym ((R")'»^) has 
a P representation fLemma lS.ip 

A^Y^Pf^^L- (214) 

jm 

Furthermore, note that due to Lemmas 15.21 and I A. 1 1 we can restrict the sum to be 
over j < 21 and j is even. However for any ^ choosing F — Sjm {x) yields by 
Lemma[13]tr(ZFA) ^ 0. □ 

Appendix B. Relating optimal solutions of even and odd degree 

We will need the following purely technical lemma when dealing with homoge- 
neous polynomials of odd degree. This will enable us to relate the optmization 
problem of optimizing homogeneous polynomials of odd degree to the optimization 
of homogeneous polynomials of even degree. 

Lemma B.l. For all I e N 

c2^-i (2^-1)^-1/2 
max 7" 7777 = -7-7 . (215) 

Proof. Let g{c,i) := (?^~^ jiX + c^Y . Computing the derivative with respect to c 
we have 

^ - 2g + (2£ - 1)^^ (216) 

dc ~ {1 + c^Y+^ + ^ ^ ^1 + c^Y ■ 

A small calculation shows that this derivative vanishes {— 0) for c = and for c > 
iff 

c V2e~l . (217) 

We now show that this is the maximum. Note that for boundary point c = we 
have g{0,i) = 0. Computing the second derivative and evaluating it at the point 
c = \/2£ — 1 we obtain 

(2^-1)^-1/2 



which is clearly negative. It remains to evaluate g{c, i) at the maximum point which 
yields our claim. □ 

The following lemma will allow us to reduce the case of optimizing homoge- 
neous polynomials of odd degree over 5"-!, to the case of optimizing homogeneous 
polynomials of even degree over the n + I dimensional sphere S" . 
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/(-') ■■= T^T^ ■ (220) 



Lemma B.2. Let T{x) be a homogeneous polynomial of odd degree 2^—1 in vari- 
ables X = (xi, . . . , Xn), and let T'{x') ~ T{x)xo for x' = (xq, cci, . . . , Xn), G M. 
Then 

max T'(x') = -fif) max T{x) , (219) 
w/iere 7(f) = (2£ - l)^-i/2/(£'^20. 

Proof. Note that T'{x') is a homogeneous polynomial of even degree 2£. Define 

{\\A\IY 

Note that ||a;'||| is a homogeneous polynomial of degree 21. Since T'{x') and ||a;'||| 
are homogeneous of degree 2£, we have for all constants k 

f{kx') = /(x') . (221) 

Observe that this implies that 

max f{x') = max f{x') = max T'{x') , (222) 

a:'6K" + i 2;' = (1,2;i,...,2;„)GK" + i a;'GS" 

as we can choose the constant k to normalize the solution of the l.h.s. to have 
xo = 1 or to reside on the hypersphcre (where ||a;'|j2 = 1) without changing the 
optimum. Note the expression in the middle can be rewritten as 

max fix') — max '^ii^lnx. • (223) 

x'=(i,xi,...,x„)m^+^ x£m^{l + \\x\\lY ^ ' 

Optimizing over x S M" is equivalent to optimizing over a vector x S S*""^, and an 
optimization over the length of this vector since we can write any for 
some X E 5"^^ with c = ||x||2. Using the fact that T(x) is homogeneous, we can 
hence express (|223p as 

max max — tt—t- ] = max T(x) max ^r-j ] . (224) 

ses—i c>o V(l + c2)'' y £es"-i ^ ' \c>o {l + c^YJ ^ ' 

" V ' 

=7(^) 

Our claim now follows by applying Lemma IB. II to evaluate 7(^). □ 
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